Search CORE

Queensland University of Technology ePrints Archive

University of Queensland eSpace

Optimal constraint-based decision tree induction from itemset lattices

Author: A Lew
A Machanavajjhala
A Moore
B Ganter
C Bucila
C Nadeau
Elisa Fromont
F Bonchi
G Blanchard
H Schumacher
HA Chipman
HJ Payne
IH Witten
J-F Boulicaut
JR Quinlan
L Breiman
L Hyafil
L Sweeney
MJ Zaki
MN Garofalakis
MR Garey
N Pasquier
P Samarati
P Turney
S Esmeir
Siegfried Nijssen
T Imielinski
W Buntine
Publication venue: Springer
Publication date: 01/01/2010
Field of study

International audienceIn this article we show that there is a strong connection between decision tree learning and local pattern mining. This connection allows us to solve the computationally hard problem of finding optimal decision trees in a wide range of applications by post-processing a set of patterns: we use local patterns to construct a global model. We exploit the connection between constraints in pattern mining and constraints in decision tree induction to develop a framework for categorizing decision tree mining constraints. This framework allows us to determine which model constraints can be pushed deeply into the pattern mining process, and allows us to improve the state-of-the-art of optimal decision tree induction

Simplivariate Models: Ideas and First Examples

Author: Age K. Smilde
AK Smilde
BGM Vandeginste
EJ Want
H Turner
HA Chipman
HL Turner
JA Hageman
JC Lindon
Johan A. Westerhuis
Jos A. Hageman
L Lazzeroni
Margriet M. W. B. Hendriks
Mariët J. van der Werf
Mark Isalan
MJ van der Werf
O Fiehn
R Bro
RA van den Berg
RA van den Berg
Ruud Berger
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

One of the new expanding areas in functional genomics is metabolomics: measuring the metabolome of an organism. Data being generated in metabolomics studies are very diverse in nature depending on the design underlying the experiment. Traditionally, variation in measurements is conceptually broken down in systematic variation and noise where the latter contains, e.g. technical variation. There is increasing evidence that this distinction does not hold (or is too simple) for metabolomics data. A more useful distinction is in terms of informative and non-informative variation where informative relates to the problem being studied. In most common methods for analyzing metabolomics (or any other high-dimensional x-omics) data this distinction is ignored thereby severely hampering the results of the analysis. This leads to poorly interpretable models and may even obscure the relevant biological information. We developed a framework from first data analysis principles by explicitly formulating the problem of analyzing metabolomics data in terms of informative and non-informative parts. This framework allows for flexible interactions with the biologists involved in formulating prior knowledge of underlying structures. The basic idea is that the informative parts of the complex metabolomics data are approximated by simple components with a biological meaning, e.g. in terms of metabolic pathways or their regulation. Hence, we termed the framework ‘simplivariate models’ which constitutes a new way of looking at metabolomics data. The framework is given in its full generality and exemplified with two methods, IDR analysis and plaid modeling, that fit into the framework. Using this strategy of ‘divide and conquer’, we show that meaningful simplivariate models can be obtained using a real-life microbial metabolomics data set. For instance, one of the simple components contained all the measured intermediates of the Krebs cycle of E. coli. Moreover, these simplivariate models were able to uncover regulatory mechanisms present in the phenylalanine biosynthesis route of E. coli

Wageningen University & Research Publications

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Genome-wide prediction using Bayesian additive regression trees

Author: A Onogi
C Hans
C Strobl
G Campos de los
G Campos de los
G Morota
H Ishwaran
HA Chipman
HA Chipman
HA Chipman
J Bleich
J Fan
JH Friedman
L Breiman
L Breiman
L Fahrmeir
M Szydłowski
MA Cleveland
MI Jordan
MT Pratola
N Heslot
O González-Recio
P Waldmann
Patrik Waldmann
R Diaz-Uriarte
R Genuer
R Howard
R Zhu
RA Fisher
S Cabras
S Okser
S Theodoridis
T Hastie
T Hastie
T Hastie
T Park
THE Meuwissen
WG Hill
WG Touw
WY Loh
X Chen
Y LeCun
Z Ghahramani
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

arXiv.org e-Print Archive

Design of Experiments for Screening

Author: A Boukouvalas
A Marrel
A Miller
A Saltelli
A Saltelli
A.E. Vine
AB Owen
AC Atkinson
AM Dean
B Abraham
B Bettonvil
B Bettonvil
B. Tang
B. Tang
BA Jones
BA Jones
BA Jones
C Daniel
C Linkletter
C.F.J. Wu
C.F.J. Wu
CA Mauro
CE Rasmussen
CJ Marley
CR Rao
CS Cheng
D Draguljić
D Dupuy
D Scott-Drechsel
D. Xing
D.T. Voss
DA Bulutoglu
DJ Finney
DKJ Lin
DKJ Lin
EI George
F Campolongo
F Campolongo
F Satterthwaite
FKH Phoa
FKH Phoa
G Damblin
G Pujol
G.S. Watson
GEP Box
GEP Box
GEP Box
GEP Box
GM James
H Moon
H. Wan
H. Xu
H. Yang
H.B.E. Wan
HA Chipman
JL Loeppky
JPC Kleijnen
K.Q. Ye
KHV Booth
KJ Ryan
KP Burnham
KT Fang
L Pronzato
L. Xiao
M Claeys-Bruno
M Hamada
M Hamada
M Johnson
M Liu
M.A. Wolters
MD McKay
MD Morris
MD Morris
MD Morris
MJ Hall
N Durrande
NA Butler
NK Nguyen
NK Nguyen
NK Nguyen
PR Scinto
PZG Qian
PZG Qian
PZG Qian
R Dorfman
R Jin
R Joseph
RB Gramacy
RK Meyer
RL Iman
RL Plackett
RV Lenth
S Ba
SC Cotter
SG Gilmour
SM Lewis
TJ Santner
VE Bowman
W DuMouchel
W Li
W.J. Welch
WA Brenneman
WW Li
X Qu
Y Benjamini
Y Liu
Publication venue
Publication date: 18/10/2015
Field of study

The aim of this paper is to review methods of designing screening experiments, ranging from designs originally developed for physical experiments to those especially tailored to experiments on numerical models. The strengths and weaknesses of the various designs for screening variables in numerical models are discussed. First, classes of factorial designs for experiments to estimate main effects and interactions through a linear statistical model are described, specifically regular and nonregular fractional factorial designs, supersaturated designs and systematic fractional replicate designs. Generic issues of aliasing, bias and cancellation of factorial effects are discussed. Second, group screening experiments are considered including factorial group screening and sequential bifurcation. Third, random sampling plans are discussed including Latin hypercube sampling and sampling plans to estimate elementary effects. Fourth, a variety of modelling methods commonly employed with screening designs are briefly described. Finally, a novel study demonstrates six screening methods on two frequently-used exemplars, and their performances are compared

Changes in deep-water CO2 concentrations over the last several decades determined from discrete pCO2 measurements

Author: Anderson
Brewer
Bullister
Byrne
Chipman
DOE
Doney
Doney
Doney
Doney
Garzoli
Gent
Geun-Ha Park
Gouretski
Gruber
Hoppema
Hoppema
Ilyina
John L. Bullister
Johnson
Johnson
Key
Khatiwala
Khatiwala
Körtzinger
Lamb
Levine
Lewis
Marion
McElligott
Mercier
Moore
Neill
Peng
Pierrot
Pierrot
Purkey
Pérez
Pérez
Richard A. Feely
Rik Wanninkhof
Ríos
Sabine
Sabine
Scott C. Doney
Sloyan
Takahashi
Tanhua
Tanhua
Taro Takahashi
van Heuven
Vázquez-Rodríguez
Wanninkhof
Wanninkhof
Wanninkhof
Wanninkhof
Weiss
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

This paper is not subject to U.S. copyright. The definitive version was published in Deep Sea Research Part I: Oceanographic Research Papers 74 (2013): 48-63, doi:10.1016/j.dsr.2012.12.005.Detection and attribution of hydrographic and biogeochemical changes in the deep ocean are challenging due to the small magnitude of their signals and to limitations in the accuracy of available data. However, there are indications that anthropogenic and climate change signals are starting to manifest at depth. The deep ocean below 2000 m comprises about 50% of the total ocean volume, and changes in the deep ocean should be followed over time to accurately assess the partitioning of anthropogenic carbon dioxide (CO2) between the ocean, terrestrial biosphere, and atmosphere. Here we determine the changes in the interior deep-water inorganic carbon content by a novel means that uses the partial pressure of CO2 measured at 20 °C, pCO2(20), along three meridional transects in the Atlantic and Pacific oceans. These changes are measured on decadal time scales using observations from the World Ocean Circulation Experiment (WOCE)/World Hydrographic Program (WHP) of the 1980s and 1990s and the CLIVAR/CO2 Repeat Hydrography Program of the past decade. The pCO2(20) values show a consistent increase in deep water over the time period. Changes in total dissolved inorganic carbon (DIC) content in the deep interior are not significant or consistent, as most of the signal is below the level of analytical uncertainty. Using an approximate relationship between pCO2(20) and DIC change, we infer DIC changes that are at the margin of detectability. However, when integrated on the basin scale, the increases range from 8–40% of the total specific water column changes over the past several decades. Patterns in chlorofluorocarbons (CFCs), along with output from an ocean model, suggest that the changes in pCO2(20) and DIC are of anthropogenic origin.Rik Wanninkhof, Geun-Ha Park, John L. Bullister, and Richard A. Feely appreciate the support from the NOAA Office of Atmospheric and Oceanic Research and the Climate Observation Division. S.C.D. acknowledges support from NOAA Grant NA07OAR4310098. T.T. has been supported by grants from NSF and NOAA

Woods Hole Open Access Server

Columbia University Academic Commons

Bayesian Wavelet Shrinkage of the Haar-Fisz Transformed Wavelet Periodogram.

Author: A Davison
A Sawczenko
B Vidakovic
B Vidakovic
C Semadeni
CH Page
DL Donoho
DL Donoho
F Ruggeri
GP Nason
GP Nason
GP Nason
Guy Nason
HA Chipman
I Daubechies
IA Eckley
IM Johnstone
Kara Stevens
Leontios Hadjileontiadis
M Pensky
MA Clyde
MB Priestley
P Fryzlewicz
P Fryzlewicz
P Müller
R Dahlhaus
R Dahlhaus
RA Silverman
RH Byrd
RR Coifman
S Barber
S Van Bellegem
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

It is increasingly being realised that many real world time series are not stationary and exhibit evolving second-order autocovariance or spectral structure. This article introduces a Bayesian approach for modelling the evolving wavelet spectrum of a locally stationary wavelet time series. Our new method works by combining the advantages of a Haar-Fisz transformed spectrum with a simple, but powerful, Bayesian wavelet shrinkage method. Our new method produces excellent and stable spectral estimates and this is demonstrated via simulated data and on differenced infant electrocardiogram data. A major additional benefit of the Bayesian paradigm is that we obtain rigorous and useful credible intervals of the evolving spectral structure. We show how the Bayesian credible intervals provide extra insight into the infant electrocardiogram data

Plymouth Electronic Archive and Research Library

Explore Bristol Research

Altered Trabecular Bone Structure and Delayed Cartilage Degeneration in the Knees of Collagen VI Null Mice

Author: A Chevrier
AK Lampe
AK Lampe
BD Furman
BD Ward
Bridgette D. Furman
C Söderhäll
CA Poole
D Felson
DA Walters
DF Bray
DF Ogletree
DM Allen
DR Keene
DR Keene
E Bertini
EA Kennedy
EBW Giesen
EM Darling
F Guilak
Farshid Guilak
G Jay Jr
G Pepe
GJ van Osch
GM Wildey
H Lam
HA Leddy
HA Leddy
Hani A. Awad
HJ Kuo
HL Glansbeek
Holly A. Leddy
J Chang
J Coles
J Fitzgerald
J Gillquisl
J Marcelino
J Nieminen
J Zeichen
J Zeichen
JB Choi
Jeffrey M. Coles
JM Coles
K Blumbach
K Gelse
K Higashi
K Hu
KJ Carlson
KM Kwan
L Cao
L Cao
L Han
L Merlini
LA Pottenger
LA Setton
LG Alexopoulos
LG Alexopoulos
LG Alexopoulos
LG Alexopoulos
M Stolz
M Zhu
N Tanaka
Nicole A. Zelenski
P Bonaldo
P Bonaldo
Paolo Bonaldo
Q Kong
R Wagener
S Chiravarambath
S Harumiya
S Park
S Petrini
S Tsukahara
SD Chipman
SK Gara
Stefan Zauscher
Susan E. Christensen
T Damrongrungruang
T Hildebrand
T Tanaka
TH Smit
TM Griffin
U Kubitscheck
U Specks
VC Mow
WA Irwin
WG Beamer
Y Miyamoto
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Mutation or loss of collagen VI has been linked to a variety of musculoskeletal abnormalities, particularly muscular dystrophies, tissue ossification and/or fibrosis, and hip osteoarthritis. However, the role of collagen VI in bone and cartilage structure and function in the knee is unknown. In this study, we examined the role of collagen VI in the morphology and physical properties of bone and cartilage in the knee joint of Col6a1−/− mice by micro-computed tomography (microCT), histology, atomic force microscopy (AFM), and scanning microphotolysis (SCAMP). Col6a1−/− mice showed significant differences in trabecular bone structure, with lower bone volume, connectivity density, trabecular number, and trabecular thickness but higher structure model index and trabecular separation compared to Col6a1+/+ mice. Subchondral bone thickness and mineral content increased significantly with age in Col6a1+/+ mice, but not in Col6a1−/− mice. Col6a1−/− mice had lower cartilage degradation scores, but developed early, severe osteophytes compared to Col6a1+/+mice. In both groups, cartilage roughness increased with age, but neither the frictional coefficient nor compressive modulus of the cartilage changed with age or genotype, as measured by AFM. Cartilage diffusivity, measured via SCAMP, varied minimally with age or genotype. The absence of type VI collagen has profound effects on knee joint structure and morphometry, yet minimal influences on the physical properties of the cartilage. Together with previous studies showing accelerated hip osteoarthritis in Col6a1−/− mice, these findings suggest different roles for collagen VI at different sites in the body, consistent with clinical data